Sequential multi-agent exploration for a common goal
نویسندگان
چکیده
Motivated by applications in Dynamic Spectrum Access Networks, we focus on a system in which a few agents are engaged in a costly individual exploration process where each agent’s benefit is determined according to the minimum obtained value. Such an exploration pattern is applicable to many systems, including shipment and travel planning. This paper formally introduces and analyzes a sequential variant of the general model. According to that variant, only a single agent engages in exploration at any given time, and when an agent initiates its exploration, it has complete information about the minimum value obtained by the other agents so far. We prove that the exploration strategy of each agent, according to the equilibrium of the resulting Stackelberg game, is reservation-value based, and show how the reservation values can be calculated. We also analyze the agents’ expected-benefit maximizing exploration strategies when they are fully cooperative (i.e., when they aim to maximize the expected joint benefit). The equilibrium strategies and the expected benefit of each agent are illustrated using a synthetic homogeneous environment, thereby demonstrating the properties of this new exploration scheme and the benefits of cooperation.
منابع مشابه
Coordinated Exploration with a Shared Goal in Costly Environments
The paper studies distributed cooperative multi-agent exploration methods in settings where the overall benefit of an opportunity is the minimum of individual findings and the exploration is costly. The primary motivation for the model is the multi-channel cooperative sensing problem which draws from the inter-vehicular cognitive offload paradigm. Here, vehicles try to coordinate an offload cha...
متن کاملCoordinated multi-agent exploration
Many successful robotic systems use maps of the environment to perform their tasks. In this paper, we propose a cooperative exploration strategy for multi-agent robots. This proposal is a parallelization of the basic SRT method, the following functionalities were added to it: cooperation to increase the efficiency, coordination to avoid conflicts and communication to cooperate and to coordinate...
متن کاملThe Exploration-Exploitation Tradeoff in Sequential Decision Making Problems
Sequential decision making problems often require an agent to act in an environment where data is noisy or not fully observed. The agent will have to learn how different actions relate to different rewards, and must therefore balance the need to explore and exploit in an effective strategy. In this report, sequential decision making problems are considered through extensions of the multi-armed ...
متن کاملAnalysing the Behaviour of Robot Teams through Relational Sequential Pattern Mining
This report outlines the use of a relational representation in a Multi-Agent domain to model the behaviour of the whole system. A desired property in this systems is the ability of the team members to work together to achieve a common goal in a cooperative manner. The aim is to define a systematic method to verify the effective collaboration among the members of a team and comparing the differe...
متن کاملCooperative Control of Mobile Robots in Creating a Runway Platform for Quadrotor Landing
Multi-agent systems are systems in which several agents accomplish a mission in a cooperative manner. In this paper, a novel idea for the construction of a movable runway platform based on multi-agent systems is presented. It is assumed that an aerial agent (quadrotor) decides to make an emergency landing due to reasons such as a decrease in energy level or technical failure, while there is no ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Web Intelligence and Agent Systems
دوره 11 شماره
صفحات -
تاریخ انتشار 2013